DNA encoding carbonic anhydrase

ABSTRACT

A cloned DNA encoding carbonic anhydrase of a monocotyledon is disclosed.

This is a U.S. National Stage Application of PCT/JP94/01814 filed Oct. 27,1994.

TECHNICAL FIELD

The present invention relates to a novel DNA which encodes carbonic anhydrase of a monocotyledon.

BACKGROUND ART

Carbonic anhydrase (carbonic dehydratase) is an enzyme widely occurring in animals and plants, which catalyzes the following reaction.

    CO.sub.2 +H.sub.2 O←→H.sup.+ +HCO.sub.3.sup.-

In C₃ plants, it is thought that carbonic anhydrase plays a role in preventing evaporation of CO₂ from chloroplasts by converting CO₂ to carbonate ion. One of substrates of ribulose bisphosphate carboxylase (Rubisco) which is an enzyme for carbon dioxide fixation is CO₂. Thus, it is thought that carbonic anhydrase supplies the substrate of Rubisco. Localization of carbonic anhydrase in cells of higher plants varies depending on the type of photosynthesis of the plant. In C₃ plants, carbonic anhydrase activity is found in chloroplasts and in C₄ plants, carbonic anhydrase activity is mainly found in cytoplasm of mesophyll cells.

As mentioned above, carbonic anhydrase catalyzes the reaction by which equilibrium between CO₂ and hydrogen carbonate ion (HCO₃ ⁻) in a solution is maintained. Although this equilibrium is reached under natural conditions, it takes a long time to reach the equilibrium if the enzyme does not participate. Therefore, if this enzyme is introduced by genetic engineering technique into a C₃ plant in which the enzyme is not localized in cytoplasm, it is thought that the reaction to reach the equilibrium between CO₂ and HCO₃ ⁻ is promoted and so the substrate of the enzyme carrying out carbon dioxide fixation is efficiently supplied, so that the ability to carry out carbon dioxide fixation of the plant is promoted.

Recently, it was reported that phosphoenol pyruvate carboxylase (PEPC) which is an enzyme catalyzing the first carbon dioxide fixation reaction of C₄ plants was introduced into a C₃ plant by genetic engineering technique (Hudspeth, R. L. et al., (1992), Plant Physiol. 98:485-464; Katsura IZUI et al., (1993), Plant Cell Technology 5: 74-82). The substrate of this enzyme is HCO₃ ⁻. Since carbonic anhydrase does not exist in cytoplasm of C₃ plants, in order that PEPC expressed in the cytoplasm efficiently functions, it is necessary to sufficiently supply HCO₃ ⁻. Thus, if carbonic anhydrase is introduced into the plant to which PEPC has been introduced, it is thought that HCO₃ ⁻ consumed by the enzyme reaction of PEPC is supplied to cytoplasm, so that the ability of carbon dioxide fixation of the plant can be further promoted.

Carbonic anhydrase genes of dicotyledons such as spinach (Burnell, J. N. et al., Plant Physiol 92:37-40 (1990); Fawcett, T. W. et al., J. Biol. Chem. 265:5414-5417), pea (Roeske, C. A. et al., Nucleic Acid Res. 18:3413 (1990); Majeau, N. et al., Plant Physiol. 95:264-268 (1991)), Arabidopsis thaliama (Raines, C. A. et al., Plant Mol. Biol. 20:1143-1148 (1992)) and tobacco (Majeau, N. et al., EMBL Nucleotide Sequence Databases, Accession No. M94135, 1992)) have been isolated and sequenced. However, since the carbonic anhydrases of monocotyledons have enzyme properties different from those of dicotyledons, it is expected that greater effects will be obtained by introducing a carbonic anhydrase gene of a monocotyledon to monocotyledons.

As for carbonic anhydrase genes of monocotyledons, maize carbonic anhydrase gene has been partially sequenced (Keith et al., Plant Physiol. 101:329-332 (1993)). However, the sequenced region is only 210 bp. It is thought that this is too short to encode an active carbonic anhydrase and so cannot be used for genetic manipulation of monocotyledons.

DISCLOSURE OF THE INVENTION

Accordingly, the object of the present invention is to provide a gene encoding a carbonic anhydrase of a monocotyledon.

The present inventors intensively studied to succeed in cloning maize carbonic anhydrase cDNA from maize cDNA library using carbonic anhydrase cDNA of spinach as a probe, and sequencing the cloned gene. The present inventors further succeeded in cloning rice carbonic anhydrase cDNA from rice cDNA library using the thus obtained maize carbonic anhydrase cDNA as a probe, and sequencing the cloned gene, thereby completing the present invention.

That is, the present invention provides a cloned DNA which encodes carbonic anhydrase of a monocotyledon.

The present invention also provides a cloned DNA encoding the amino acid sequence shown in SEQ ID NO. 1, 4, 6 or 8 in Sequence Listing or the same amino acid sequence as shown in SEQ ID NO. 1, 4, 6 or 8 in Sequence Listing except that one or more amino acid is added, deleted or substituted, said amino acid sequence give enzyme activity of carbonic anhydrase.

By the present invention, a cloned DNA which encodes carbonic anhydrase of a monocotyledon was first provided. It is expected that by transforming a monocotyledon with this gene, the ability of carbon dioxide fixation of the plant can be promoted, so that growth of the plant can be accelerated.

BEST MODE FOR CARRYING OUT THE INVENTION

The DNA according to the present invention encodes carbonic anhydrase. Examples thereof include DNAs encoding amino acid sequences of maize carbonic anhydrases, which are shown in SEQ ID NOs. 1, 6 and 8, and the DNA encoding the amino acid sequence of rice carbonic anhydrase, which is shown in SEQ ID NO. 4. The amino acid sequences shown in SEQ ID NOs. 1, 6, 8 and 4 were determined in the examples described below. The nucleotide sequences of the DNAs isolated in the examples described below are shown in SEQ ID NOs. 2, 7, 9 and 5. The amino acid sequences shown in SEQ ID NOs. 2, 7, 9 and 5 are shown in SEQ ID NOs. 1, 6, 8 and 4, respectively. It should be noted that the amino acid sequences shown in SEQ ID NOs. 1, 6, 8 and 4 were also first determined by the present invention.

It is well-known in the art that there are cases wherein the activity of an enzyme is retained even if the amino acid sequence of an enzyme is modified to a small extent, that is, even if one or more amino acids in the amino acid sequence are substituted or deleted, or even if one or more amino acids are added to the amino acid sequence. DNAs encoding the proteins having such modifications and having carbonic anhydrase activity are included within the scope of the present invention. That is, cloned DNAs encoding amino acid sequences having the same amino acid sequence as SEQ ID NO. 1, 4, 6 or 8 except that one or more amino acids are substituted, deleted or added, which give the enzyme activity of carbonic anhydrase, are also included in the scope of the present invention. Similarly, DNAs having the same nucleotide sequence as SEQ ID NO. 2, 5, 7 or 9 except that one or more nucleotides are substituted, deleted or added, which encodes an amino acid sequence giving the enzyme activity of carbonic anhydrase are also included within the scope of the present invention.

Modification of DNA which brings about addition, deletion or substitution of the amino acid sequence encoded thereby can be attained by the site-specific mutagenesis which is well-known in the art (e.g., Nucleic Acid Research, Vol. 10, No. 20, p6487-6500, 1982). In the present specification, "one or more amino acids" means the number of amino acids which can be added, deleted or substituted by the site-specific mutagenesis.

Site-specific mutagenesis may be carried out by, for example, using a synthetic oligonucleotide primer complementary to a single-stranded phage DNA except that the desired mutation as follows. That is, using the above-mentioned synthetic oligonucleotide as a primer, a complementary chain is produced by a phage, and host bacterial cells are transformed with the obtained double-stranded DNA. The culture of the transformed bacterial cells is plated on agar and plaques are formed from a single cell containing the phage. Theoretically, 50% of the new colonies contain the phage having a single-stranded chain carrying the mutation and remaining 50% of the colonies contain the phage having the original sequence. The obtained plaques are then subjected to hybridization with a kinase-treated synthetic probe at a temperature at which the probe is hybridized with the DNA having exactly the same sequence as the DNA having the desired mutation but not with the original DNA sequence that is not completely complementary with the probe. Then the plaques in which the hybridization was observed are picked up, cultured and the DNA is collected.

In addition to the above-mentioned site-specific mutagenesis, the methods for substituting, deleting or adding one or more amino acids without losing the enzyme activity include a method in which the gene is treated with a mutagen and a method in which the gene is selectively cleaved, a selected nucleotide is removed, added or substituted and then the gene is ligated.

The DNAs according to the present invention may be obtained by the methods described in detail in the examples below. Alternatively, since the nucleotide sequences were determined by the present invention, the DNAs according to the present invention can be easily obtained by the PCR method utilizing the genome of maize or rice as a template and also by so called RT-PCR method using their RNAs as a template.

By inserting the DNA according to the present invention into an expression vector for plants by a conventional method and by transforming a monocotyledon with the obtained recombinant vector, carbonic anhydrase can be expressed in the monocotyledon, thereby promoting the ability of carbon dioxide fixation of the plant and, in turn, accelerating the growth of the plant.

The present invention will now be described in more detail by way of examples. It should be noted that the present invention is not restricted to the examples.

EXAMPLE 1

Isolation of Maize Carbonic Anhydrase cDNA

From green leaves of maize, RNAs were extracted and polyA⁺ RNAs were isolated using DYNABEADS (commercially available from BERITUS) according to the instructions by the manufacturer. According to a conventional method, phage-infected bacterial cells were plated on a medium and plaques obtained by culturing the plate at 37° C. were transferred to a nylon membrane (Hybond N⁺, commercially available from AMERSHAM). The library was screened by using a probe obtained by labelling the EcoRI fragment (790 bp) of spinach carbonic anhydrase cDNA (Burnell et al., (1990) Plant Physiol. 92:37-40) with α-³² P!dCTP (commercially available from AMERSHAM) by using Gigaprime Labelling kit (commercially available from Bresatec, Adelaide, Australia), and positive clones were selected. Hybridization was performed at 42° C. for 16-24 hours in a solution containing 6×SSPE, 5×Denhalt's solution, 0.5% (w/v) SDS, 100 μg/ml of herring sperm DNA, 10 mM phosphate buffer (pH 7.0) and 50% (v/v) formamide, to which the probe labelled with ³² P was added. The membranes were then washed in 2×SSC containing 0.1% (w/v) SDS at room temperature for 30 minutes and then with 1×SSC containing 0.1% (w/v) SDS at 60° C. for 30 minutes. The membranes were then subjected to autoradiography and positive clones were selected. The inserts of the obtained clones were subcloned into pTZ18R and sequenced by dideoxy method. The determined sequence is shown in SEQ ID. NO. 2. This sequence has a homology of 60.3% with the EcoRI fragment of the spinach carbonic anhydrase used as a probe, and has a homology of 98.8% with the reported maize cDNA fragment having a homology with the gene encoding pea chloroplast type carbonic anhydrase.

EXAMPLE 2

Isolation of Rice Carbonic Anhydrase cDNA

(1) Purification of Rice Carbonic Anhydrase and Determination of Amino Acid Sequence of N-terminal

One hundred grams of rice leaves cultivated under long day regimen was ground with 300 ml of extraction buffer (50 mM Hepes-KOH pH 7.5, 10 mM MgSO₄, 1 mM EDTA, 20 mM 2-mercaptoethanol). The resultant was filtered through two layers of MIRACLOTH (commercially available from KARBIOCHEM), and the filtrate was centrifuged at 30,000×g for 20 minutes to remove insoluble materials, thereby obtaining a crude extract. The crude extract was fractioned by sodium sulfate of 40-60% saturation (0° C.). The obtained precipitate was dissolved in a column buffer (20 mM Hepes-KOH pH 7.5, 20 mM 2-mercaptoethanol) and applied to preliminarily equilibrated Sephadex G25 (commercially available from Pharmacia) column (inner diameter 2.5 cm×35 cm), thereby carrying out desalination. The desalinated crude extract was applied to preliminarily equilibrated DEAE-Cellulose 52 (commercially available from WHATMAN) column (inner diameter 2.5 cm×20 cm). After sufficiently washing the column with the column buffer, the adsorbed proteins were eluted by linear gradient of KCl from 0 to 0.3M. The fractions exhibiting carbonic anhydrase activity were combined and solid ammonium sulfate was added to a concentration of 65% (0° C.) to precipitate the proteins. The generated precipitate was dissolved in 3 ml of column buffer and the solution was applied to Sepharose CL-6B (commercially available from Pharmacia) column (inner diameter 2.5 cm×96 cm) preliminarily equilibrated with the column buffer, thereby fractioning the proteins. Among the eluted fractions, the fraction exhibiting the highest carbonic anhydrase activity was subjected to SDS-polyacrylamide gel electrophoresis and the band of carbonic anhydrase protein was identified by Western blotting using anti-maize carbonic anhydrase polyclonal antibody. On the other hand, the isolated protein after the SDS-polyacrylamide gel electrophoresis was electrically transferred to a PVDF membrane (commercially available from Millipore), and the band of carbonic anhydrase was cut out. The amino acid sequence of N-terminal region of the protein was determined by gas phase Edman degradation method using 447A Protein Sequencer commercially available from Applied Biosystems. The determined amino acid sequence is shown in SEQ ID. NO. 3.

(2) Isolation of Rice Carbonic Anhydrase cDNA

From green leaves of maize, RNAs were extracted and polyA³⁰ RNAs were isolated using DYNABEADS (commercially available from BERITUS) according to the instructions by the manufacturer. A cDNA library employing as a vector a phage vector called λZapII vector using cDNA synthesis kit and direct cloning kit which are commercially available from Pharmacia. The library was screened by using a probe obtained by labelling the maize carbonic anhydrase cDNA fragment (1.8 kb) with α-³² P!dCTP (commercially available from AMERSHAM) by using Gigaprime Labelling kit (commercially available from Bresatec, Adelaide, Australia), and positive clones were selected. Hybridization was performed at 42° C. for 16-24 hours in a solution containing 6×SSPE, 5×Denhalt's solution, 0.5% (w/v) SDS, 100 μg/ml herring sperm DNA, 10 mM phosphate buffer (pH 7.0) and 50% (v/v) formamide, to which the probe labelled with ³² P was added. The membrane was then washed in 2×SSC containing 0.1% (w/v) SDS at room temperature for 30 minutes and then with 1×SSC containing 0.1% (w/v) SDS at 60° C. for 30 minutes. The membrane was then subjected to autoradiography and positive clones were selected. The obtained clones were subcloned into a vector pBluescript by in vivo excision method and then sequenced by dideoxy method using T7 Sequence kit (commercially available from Pharmacia) (SEQ ID. NO. 5). In the amino acid sequence deduced from this nucleotide sequence, a region identical to the amino acid sequence of the N-terminal region determined in (1) existed. Therefore, the obtained cDNA clone was judged to be a gene encoding carbonic anhydrase.

EXAMPLE 3

Isolation of Maize Carbonic Anhydrase cDNA

The same procedure as in Example 1 was repeated except that 5'-end region (EcoRI-BstXI fragment, 135 bp) of the maize carbonic anhydrase cDNA obtained in Example 1 was used as the probe in place of spinach carbonic anhydrase cDNA. As a result, two carbonic anhydrase cDNA clones (CAI and CAII) were obtained. Although these cDNAs have very high homologies with the maize carbonic anhydrase cDNA obtained above, they are not completely identical. The nucleotide sequences of CAI and CAII as well as deduced amino acid sequences are shown in SEQ ID NOs. 7 and 9, respectively.

INDUSTRIAL AVAILABILITY

Since the DNAs according to the present invention encode carbonic anhydrase of monocotyledons, it is expected that by transforming a monocotyledons with the DNA, the ability of carbon dioxide fixation of the plant may be promoted and growth of the plant may be accelerated.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 9     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 651 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - Tyr Thr Leu Pro Val Arg Thr Thr Thr Ser Se - #r Ile Val Pro Ala Cys     #                 15     - His Pro Arg Ala Val Leu Leu Leu Arg Leu Ar - #g Pro Pro Gly Ser Gly     #             30     - Ser Ser Gly Thr Pro Arg Leu Arg Arg Pro Al - #a Thr Val Val Gly Met     #         45     - Asp Pro Thr Val Glu Arg Leu Lys Ser Gly Ph - #e Gln Lys Phe Lys Thr     #     60     - Glu Val Tyr Asp Lys Lys Pro Glu Leu Phe Gl - #u Pro Leu Lys Ser Gly     # 80     - Gln Ser Pro Arg Tyr Met Val Phe Ala Cys Se - #r Asp Ser Arg Val Cys     #                 95     - Pro Ser Val Thr Leu Gly Leu Gln Pro Gly Gl - #u Ala Phe Thr Val Arg     #           110     - Asn Ile Ala Ser Met Val Pro Pro Tyr Asp Ly - #s Ile Lys Tyr Ala Gly     #       125     - Thr Gly Ser Ala Ile Glu Tyr Ala Val Cys Al - #a Leu Lys Val Gln Val     #   140     - Ile Val Val Ile Gly His Ser Cys Cys Gly Gl - #y Ile Arg Ala Leu Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Ser Leu Lys Asp Gly Ala Pro Asp Asn Phe Th - #r Phe Val Glu Asp Trp     #               175     - Val Arg Ile Gly Ser Pro Ala Lys Asn Lys Va - #l Lys Lys Glu His Ala     #           190     - Ser Val Pro Phe Asp Asp Gln Cys Ser Ile Le - #u Glu Lys Glu Ala Val     #       205     - Asn Val Ser Leu Gln Asn Leu Lys Ser Tyr Pr - #o Phe Val Lys Glu Gly     #   220     - Leu Ala Gly Gly Thr Leu Lys Leu Val Gly Al - #a His Tyr Ser Phe Val     225                 2 - #30                 2 - #35                 2 -     #40     - Lys Gly Gln Phe Val Thr Trp Glu Pro Pro Gl - #n Asp Ala Ile Glu Arg     #               255     - Leu Thr Ser Gly Phe Gln Gln Phe Lys Val As - #n Val Tyr Asp Lys Lys     #           270     - Pro Glu Leu Phe Gly Pro Leu Lys Ser Gly Gl - #n Ala Pro Lys Tyr Met     #       285     - Val Phe Ala Cys Ser Asp Ser Arg Val Cys Pr - #o Ser Val Thr Leu Gly     #   300     - Leu Gln Pro Ala Lys Ala Phe Thr Val Arg As - #n Ile Ala Ala Met Val     305                 3 - #10                 3 - #15                 3 -     #20     - Pro Gly Tyr Asp Lys Thr Lys Tyr Thr Gly Il - #e Gly Ser Ala Ile Glu     #               335     - Tyr Ala Val Cys Ala Leu Lys Val Glu Val Le - #u Val Val Ile Gly His     #           350     - Ser Cys Cys Gly Gly Ile Arg Ala Leu Leu Se - #r Leu Lys Asp Gly Ala     #       365     - Pro Asp Asn Phe His Phe Val Glu Asp Trp Va - #l Arg Ile Gly Ser Pro     #   380     - Ala Lys Asn Lys Val Lys Lys Glu His Ala Se - #r Val Pro Phe Asp Asp     385                 3 - #90                 3 - #95                 4 -     #00     - Gln Cys Ser Ile Leu Glu Lys Glu Ala Val As - #n Val Ser Leu Gln Asn     #               415     - Leu Lys Ser Tyr Pro Leu Val Lys Glu Gly Le - #u Ala Gly Gly Thr Ser     #           430     - Ser Gly Trp Pro His Tyr Asp Phe Val Lys Gl - #y Gln Phe Val Thr Trp     #       445     - Glu Pro Pro Gln Asp Ala Ile Glu Arg Leu Th - #r Ser Gly Phe Gln Gln     #   460     - Phe Lys Val Asn Val Tyr Asp Lys Lys Pro Gl - #u Leu Phe Gly Pro Leu     465                 4 - #70                 4 - #75                 4 -     #80     - Lys Ser Gly Gln Ala Pro Lys Tyr Met Val Ph - #e Ala Cys Ser Asp Ser     #               495     - Arg Val Cys Pro Ser Val Thr Leu Pro Ala Al - #a Gly Glu Ala Phe Thr     #           510     - Val Arg Asn Ile Ala Ala Met Val Gln Gly Ty - #r Asp Lys Thr Lys Tyr     #       525     - Thr Gly Ile Gly Ser Ala Ile Glu Tyr Ala Va - #l Cys Ala Leu Lys Val     #   540     - Glu Val Leu Val Val Ile Gly His Ser Cys Cy - #s Gly Gly Ile Arg Ala     545                 5 - #50                 5 - #55                 5 -     #60     - Leu Leu Ser Leu Gln Asp Gly Ala Pro Asp Th - #r Phe His Phe Val Glu     #               575     - Asp Trp Val Lys Ile Ala Phe Ile Ala Lys Me - #t Lys Val Lys Lys Glu     #           590     - His Ala Ser Val Pro Phe Asp Asp Gln Trp Se - #r Ile Leu Glu Lys Glu     #       605     - Ala Val Asn Val Ser Leu Glu Asn Leu Lys Th - #r Tyr Pro Phe Val Lys     #   620     - Glu Gly Leu Ala Asn Gly Thr Leu Lys Leu Il - #e Gly Ala His Tyr Asp     625                 6 - #30                 6 - #35                 6 -     #40     - Phe Val Ser Gly Glu Phe Leu Thr Trp Lys Ly - #s     #               650     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2178 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA to mRNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1953     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - TAC ACA TTG CCC GTC CGT ACC ACC ACA TCC AG - #C ATC GTG CCA GCC TGC       48     Tyr Thr Leu Pro Val Arg Thr Thr Thr Ser Se - #r Ile Val Pro Ala Cys     #                 15     - CAC CCC CGC GCC GTC CTC CTC CTC CGG CTC CG - #G CCC CCA GGC TCA GGC       96     His Pro Arg Ala Val Leu Leu Leu Arg Leu Ar - #g Pro Pro Gly Ser Gly     #             30     - TCA TCC GGA ACG CCC CGT CTT CGC CGC CCC GC - #C ACC GTC GTG GGC ATG      144     Ser Ser Gly Thr Pro Arg Leu Arg Arg Pro Al - #a Thr Val Val Gly Met     #         45     - GAC CCC ACC GTC GAG CGC TTG AAG AGC GGG TT - #C CAG AAG TTC AAG ACC      192     Asp Pro Thr Val Glu Arg Leu Lys Ser Gly Ph - #e Gln Lys Phe Lys Thr     #     60     - GAG GTC TAT GAC AAG AAG CCG GAG CTG TTC GA - #G CCT CTC AAG TCC GGC      240     Glu Val Tyr Asp Lys Lys Pro Glu Leu Phe Gl - #u Pro Leu Lys Ser Gly     # 80     - CAG AGC CCC AGG TAC ATG GTG TTC GCC TGC TC - #C GAC TCC CGC GTG TGC      288     Gln Ser Pro Arg Tyr Met Val Phe Ala Cys Se - #r Asp Ser Arg Val Cys     #                 95     - CCG TCG GTG ACA CTG GGA CTG CAG CCC GGC GA - #G GCA TTC ACC GTC CGC      336     Pro Ser Val Thr Leu Gly Leu Gln Pro Gly Gl - #u Ala Phe Thr Val Arg     #           110     - AAC ATC GCT TCC ATG GTC CCA CCC TAC GAC AA - #G ATC AAG TAC GCC GGC      384     Asn Ile Ala Ser Met Val Pro Pro Tyr Asp Ly - #s Ile Lys Tyr Ala Gly     #       125     - ACA GGG TCC GCC ATC GAG TAC GCC GTG TGC GC - #G CTC AAG GTG CAG GTC      432     Thr Gly Ser Ala Ile Glu Tyr Ala Val Cys Al - #a Leu Lys Val Gln Val     #   140     - ATC GTG GTC ATT GGC CAC AGC TGC TGC GGT GG - #C ATC AGG GCG CTC CTC      480     Ile Val Val Ile Gly His Ser Cys Cys Gly Gl - #y Ile Arg Ala Leu Leu     145                 1 - #50                 1 - #55                 1 -     #60     - TCC CTC AAG GAC GGC GCG CCC GAC AAC TTC AC - #C TTC GTG GAG GAC TGG      528     Ser Leu Lys Asp Gly Ala Pro Asp Asn Phe Th - #r Phe Val Glu Asp Trp     #               175     - GTC AGG ATC GGC AGC CCT GCC AAG AAC AAG GT - #G AAG AAA GAG CAC GCG      576     Val Arg Ile Gly Ser Pro Ala Lys Asn Lys Va - #l Lys Lys Glu His Ala     #           190     - TCC GTG CCG TTC GAT GAC CAG TGC TCC ATC CT - #G GAG AAG GAG GCC GTG      624     Ser Val Pro Phe Asp Asp Gln Cys Ser Ile Le - #u Glu Lys Glu Ala Val     #       205     - AAC GTG TCG CTC CAG AAC CTC AAG AGC TAC CC - #C TTC GTC AAG GAA GGG      672     Asn Val Ser Leu Gln Asn Leu Lys Ser Tyr Pr - #o Phe Val Lys Glu Gly     #   220     - CTG GCC GGC GGG ACG CTC AAG CTG GTT GGC GC - #C CAC TAC AGC TTC GTC      720     Leu Ala Gly Gly Thr Leu Lys Leu Val Gly Al - #a His Tyr Ser Phe Val     225                 2 - #30                 2 - #35                 2 -     #40     - AAA GGG CAG TTC GTC ACA TGG GAG CCT CCC CA - #G GAC GCC ATC GAG CGC      768     Lys Gly Gln Phe Val Thr Trp Glu Pro Pro Gl - #n Asp Ala Ile Glu Arg     #               255     - TTG ACG AGC GGC TTC CAG CAG TTC AAG GTC AA - #T GTC TAT GAC AAG AAG      816     Leu Thr Ser Gly Phe Gln Gln Phe Lys Val As - #n Val Tyr Asp Lys Lys     #           270     - CCG GAG CTT TTC GGG CCT CTC AAG TCC GGC CA - #G GCC CCC AAG TAC ATG      864     Pro Glu Leu Phe Gly Pro Leu Lys Ser Gly Gl - #n Ala Pro Lys Tyr Met     #       285     - GTG TTC GCC TGC TCC GAC TCC CGT GTG TGC CC - #G TCG GTG ACC CTG GGC      912     Val Phe Ala Cys Ser Asp Ser Arg Val Cys Pr - #o Ser Val Thr Leu Gly     #   300     - CTG CAG CCC GCG AAG GCC TTC ACC GTT CGC AA - #C ATC GCC GCC ATG GTC      960     Leu Gln Pro Ala Lys Ala Phe Thr Val Arg As - #n Ile Ala Ala Met Val     305                 3 - #10                 3 - #15                 3 -     #20     - CCA GGC TAC GAC AAG ACC AAG TAC ACC GGC AT - #C GGG TCC GCC ATC GAG     1008     Pro Gly Tyr Asp Lys Thr Lys Tyr Thr Gly Il - #e Gly Ser Ala Ile Glu     #               335     - TAC GCT GTG TGC GCC CTC AAG GTG GAG GTC CT - #C GTG GTC ATT GGC CAT     1056     Tyr Ala Val Cys Ala Leu Lys Val Glu Val Le - #u Val Val Ile Gly His     #           350     - AGC TGC TGC GGT GGC ATC AGG GCG CTC CTC TC - #C CTC AAG GAC GGC GCG     1104     Ser Cys Cys Gly Gly Ile Arg Ala Leu Leu Se - #r Leu Lys Asp Gly Ala     #       365     - CCC GAC AAC TTC CAC TTC GTG GAG GAC TGG GT - #C AGG ATC GGC AGC CCT     1152     Pro Asp Asn Phe His Phe Val Glu Asp Trp Va - #l Arg Ile Gly Ser Pro     #   380     - GCC AAG AAC AAG GTG AAG AAA GAG CAC GCG TC - #C GTG CCG TTC GAT GAC     1200     Ala Lys Asn Lys Val Lys Lys Glu His Ala Se - #r Val Pro Phe Asp Asp     385                 3 - #90                 3 - #95                 4 -     #00     - CAG TGC TCC ATC CTG GAG AAG GAG GCC GTG AA - #C GTG TCG CTC CAG AAC     1248     Gln Cys Ser Ile Leu Glu Lys Glu Ala Val As - #n Val Ser Leu Gln Asn     #               415     - CTC AAG AGC TAC CCC TTG GTC AAG GAA GGG CT - #G GCC GGC GGG ACG TCA     1296     Leu Lys Ser Tyr Pro Leu Val Lys Glu Gly Le - #u Ala Gly Gly Thr Ser     #           430     - AGT GGT TGG CCC CAC TAC GAC TTC GTT AAA GG - #G CAG TTC GTC ACA TGG     1344     Ser Gly Trp Pro His Tyr Asp Phe Val Lys Gl - #y Gln Phe Val Thr Trp     #       445     - GAG CCT CCC CAG GAC GCC ATC GAG CGC TTG AC - #G AGC GGC TTC CAG CAG     1392     Glu Pro Pro Gln Asp Ala Ile Glu Arg Leu Th - #r Ser Gly Phe Gln Gln     #   460     - TTC AAG GTC AAT GTC TAT GAC AAG AAG CCG GA - #G CTT TTC GGG CCT CTC     1440     Phe Lys Val Asn Val Tyr Asp Lys Lys Pro Gl - #u Leu Phe Gly Pro Leu     465                 4 - #70                 4 - #75                 4 -     #80     - AAG TCC GGC CAG GCC CCC AAG TAC ATG GTG TT - #C GCC TGC TCC GAC TCC     1488     Lys Ser Gly Gln Ala Pro Lys Tyr Met Val Ph - #e Ala Cys Ser Asp Ser     #               495     - CGT GTG TGC CCG TCG GTG ACC CTG CCT GCA GC - #C GGC GAG GCC TTC ACC     1536     Arg Val Cys Pro Ser Val Thr Leu Pro Ala Al - #a Gly Glu Ala Phe Thr     #           510     - GTT CGC AAC ATC GCC GCC ATG GTC CAG GGC TA - #C GAC AAG ACC AAG TAC     1584     Val Arg Asn Ile Ala Ala Met Val Gln Gly Ty - #r Asp Lys Thr Lys Tyr     #       525     - ACC GGC ATC GGG TCC GCC ATC GAG TAC GCT GT - #G TGC GCC CTC AAG GTG     1632     Thr Gly Ile Gly Ser Ala Ile Glu Tyr Ala Va - #l Cys Ala Leu Lys Val     #   540     - GAG GTC CTC GTG GTC ATT GGC CAT AGC TGC TG - #C GGT GGC ATC AGG GCG     1680     Glu Val Leu Val Val Ile Gly His Ser Cys Cy - #s Gly Gly Ile Arg Ala     545                 5 - #50                 5 - #55                 5 -     #60     - CTC CTC TCA CTC CAG GAC GGC GCA CCT GAC AC - #C TTC CAC TTC GTC GAG     1728     Leu Leu Ser Leu Gln Asp Gly Ala Pro Asp Th - #r Phe His Phe Val Glu     #               575     - GAC TGG GTT AAG ATC GCC TTC ATT GCC AAG AT - #G AAG GTA AAG AAA GAG     1776     Asp Trp Val Lys Ile Ala Phe Ile Ala Lys Me - #t Lys Val Lys Lys Glu     #           590     - CAC GCC TCG GTG CCG TTC GAT GAC CAG TGG TC - #C ATT CTC GAG AAG GAG     1824     His Ala Ser Val Pro Phe Asp Asp Gln Trp Se - #r Ile Leu Glu Lys Glu     #       605     - GCC GTG AAC GTG TCC CTG GAG AAC CTC AAG AC - #C TAC CCC TTC GTC AAG     1872     Ala Val Asn Val Ser Leu Glu Asn Leu Lys Th - #r Tyr Pro Phe Val Lys     #   620     - GAA GGG CTT GCA AAT GGG ACC CTC AAG CTG AT - #C GGC GCC CAC TAC GAC     1920     Glu Gly Leu Ala Asn Gly Thr Leu Lys Leu Il - #e Gly Ala His Tyr Asp     625                 6 - #30                 6 - #35                 6 -     #40     - TTT GTC TCA GGA GAG TTC CTC ACA TGG AAA AA - #G TGAAAAACTA GGGCTTTCCG     1973     Phe Val Ser Gly Glu Phe Leu Thr Trp Lys Ly - #s     #               650     - TTAAGATGGC CGGGCGGCTG AGGACGTAGT AGTATTTATA TATTACTCTA TA - #ACTATACT     2033     - ACTACGTACC TACCGATATG CACCCGAGCA ATGTGAATGC GTCGAGTACT AT - #CTGTTTTC     2093     - TGCATCTACA TATATATACC GGATCAACAA TCGCCCAATG TGAATGTAAT AA - #GCAATATC     2153     #             2178 CATT CCTAA     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 23 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: Not R - #elevant               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     - Ala Ala Pro Val Ala Pro Ala Ala Met Asp Al - #a Ala Val Asp Arg Leu     #                15     - Xaa Asp Gly Phe Ala Lys Phe                 20     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 272 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     - Met Ser Thr Ala Ala Ala Ala Ala Ala Ala Gl - #n Ser Trp Cys Phe Ala     #                 15     - Thr Val Thr Pro Arg Ser Arg Ala Thr Val Va - #l Ala Ser Leu Ala Ser     #             30     - Pro Ser Pro Ser Ser Ser Ser Ser Ser Ser As - #n Ser Ser Asn Leu Pro     #         45     - Ala Pro Phe Arg Pro Arg Leu Ile Arg Asn Th - #r Pro Val Phe Ala Ala     #     60     - Pro Val Ala Pro Ala Ala Met Asp Ala Ala Va - #l Asp Arg Leu Lys Asp     # 80     - Gly Phe Ala Lys Phe Lys Thr Glu Phe Tyr As - #p Lys Lys Pro Glu Leu     #                 95     - Phe Glu Pro Leu Lys Ala Gly Gln Ala Pro Ly - #s Tyr Met Val Phe Ser     #           110     - Cys Ala Asp Ser Arg Val Cys Pro Ser Val Th - #r Met Gly Leu Glu Pro     #       125     - Gly Glu Ala Phe Thr Val Arg Asn Ile Ala As - #n Met Val Pro Ala Tyr     #   140     - Cys Lys Ile Lys His Ala Gly Val Gly Ser Al - #a Ile Glu Tyr Ala Val     145                 1 - #50                 1 - #55                 1 -     #60     - Cys Ala Leu Lys Val Glu Leu Ile Val Val Il - #e Gly His Ser Arg Cys     #               175     - Gly Gly Ile Lys Ala Leu Leu Ser Leu Lys As - #p Gly Ala Pro Asp Ser     #           190     - Phe His Phe Val Glu Asp Trp Val Arg Thr Gl - #y Phe Pro Ala Lys Lys     #       205     - Lys Val Gln Thr Glu His Ala Ser Leu Pro Ph - #e Asp Asp Gln Cys Ala     #   220     - Ile Leu Glu Lys Glu Ala Val Asn Gln Ser Le - #u Glu Asn Leu Lys Thr     225                 2 - #30                 2 - #35                 2 -     #40     - Tyr Pro Phe Val Lys Glu Gly Ile Ala Asn Gl - #y Thr Leu Lys Leu Val     #               255     - Gly Gly His Tyr Asp Phe Val Ser Gly Asn Le - #u Asp Leu Trp Glu Pro     #           270     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1167 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA to mRNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 36..851     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     #GCC GCC GCC        53T GCACCGCCTC TCACA ATG TCG ACC     #   Met Ser Thr Ala Ala Ala     #               655     - GCC GCC GCT GCC CAG AGC TGG TGC TTC GCC AC - #T GTC ACC CCG CGC TCC      101     Ala Ala Ala Ala Gln Ser Trp Cys Phe Ala Th - #r Val Thr Pro Arg Ser     #       670     - CGC GCC ACA GTC GTC GCC AGC CTC GCC TCC CC - #A TCA CCG TCC TCC TCC      149     Arg Ala Thr Val Val Ala Ser Leu Ala Ser Pr - #o Ser Pro Ser Ser Ser     #   685     - TCC TCC TCC TCC AAC AGC AGC AAC CTC CCG GC - #C CCC TTC CGC CCC CGC      197     Ser Ser Ser Ser Asn Ser Ser Asn Leu Pro Al - #a Pro Phe Arg Pro Arg     690                 6 - #95                 7 - #00                 7 -     #05     - CTC ATC CGC AAC ACC CCC GTC TTC GCC GCC CC - #C GTC GCC CCC GCC GCG      245     Leu Ile Arg Asn Thr Pro Val Phe Ala Ala Pr - #o Val Ala Pro Ala Ala     #               720     - ATG GAC GCC GCC GTC GAC CGC CTC AAG GAT GG - #G TTC GCC AAG TTC AAG      293     Met Asp Ala Ala Val Asp Arg Leu Lys Asp Gl - #y Phe Ala Lys Phe Lys     #           735     - ACC GAG TTC TAT GAC AAG AAG CCG GAG CTC TT - #C GAG CCG CTC AAG GCC      341     Thr Glu Phe Tyr Asp Lys Lys Pro Glu Leu Ph - #e Glu Pro Leu Lys Ala     #       750     - GGC CAG GCA CCC AAG TAC ATG GTG TTC TCG TG - #C GCC GAC TCT CGC GTG      389     Gly Gln Ala Pro Lys Tyr Met Val Phe Ser Cy - #s Ala Asp Ser Arg Val     #   765     - TGC CCG TCG GTG ACC ATG GGC CTG GAG CCC GG - #C GAG GCC TTC ACC GTC      437     Cys Pro Ser Val Thr Met Gly Leu Glu Pro Gl - #y Glu Ala Phe Thr Val     770                 7 - #75                 7 - #80                 7 -     #85     - CGC AAC ATC GCC AAC ATG GTC CCA GCT TAC TG - #C AAG ATC AAG CAC GCT      485     Arg Asn Ile Ala Asn Met Val Pro Ala Tyr Cy - #s Lys Ile Lys His Ala     #               800     - GGC GTC GGG TCG GCC ATC GAG TAC GCC GTC TG - #C GCC CTC AAG GTC GAA      533     Gly Val Gly Ser Ala Ile Glu Tyr Ala Val Cy - #s Ala Leu Lys Val Glu     #           815     - CTC ATC GTG GTG ATT GGC CAC AGC CGC TGC GG - #T GGA ATC AAG GCC CTC      581     Leu Ile Val Val Ile Gly His Ser Arg Cys Gl - #y Gly Ile Lys Ala Leu     #       830     - CTC TCA CTC AAG GAT GGA GCA CCA GAC TCC TT - #C CAC TTC GTC GAG GAC      629     Leu Ser Leu Lys Asp Gly Ala Pro Asp Ser Ph - #e His Phe Val Glu Asp     #   845     - TGG GTC AGG ACC GGT TTC CCC GCC AAG AAG AA - #G GTT CAG ACC GAG CAC      677     Trp Val Arg Thr Gly Phe Pro Ala Lys Lys Ly - #s Val Gln Thr Glu His     850                 8 - #55                 8 - #60                 8 -     #65     - GCC TCG CTG CCT TTC GAT GAC CAA TGC GCC AT - #C TTG GAG AAG GAG GCC      725     Ala Ser Leu Pro Phe Asp Asp Gln Cys Ala Il - #e Leu Glu Lys Glu Ala     #               880     - GTG AAC CAA TCC CTG GAG AAC CTC AAG ACC TA - #C CCG TTC GTC AAG GAG      773     Val Asn Gln Ser Leu Glu Asn Leu Lys Thr Ty - #r Pro Phe Val Lys Glu     #           895     - GGG ATC GCC AAC GGC ACC CTC AAG CTC GTC GG - #C GGC CAC TAC GAC TTC      821     Gly Ile Ala Asn Gly Thr Leu Lys Leu Val Gl - #y Gly His Tyr Asp Phe     #       910     - GTC TCC GGC AAC TTG GAC TTA TGG GAG CCC TA - #AATCCGAC CGTCCGTCCG      871     Val Ser Gly Asn Leu Asp Leu Trp Glu Pro     #   920     - TTCAGTTCGT CAGTTTACGC CAACGCTTTT GCATAAGTAC TACCTGAGGA TA - #TCGTCCCC      931     - GATCATCGAT GTGAACGCGT GGAGTACTAC TACGTACGTA CCGGATGGTT CG - #ATATATGT      991     - GAATGCTGTA TTAAGTAATA ACAAGAAATA TATCTCCTCT ACTTTTTCCT GA - #CGCGGAGT     1051     - TGTACTGCCT ATGATGCATA ATTTGATCGC AGTGTGATCA AAAGACATCA GC - #TATAATGT     1111     - CTTAATAATA TTATTATGAA GAGTTTACCT TTTTACTAAA AAAAAAAAAA AA - #AAAA     1167     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 655 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     - Met Tyr Thr Leu Pro Val Arg Ala Thr Thr Se - #r Ser Ile Val Ala Ser     #                 15     - Leu Ala Thr Pro Ala Pro Ser Ser Ser Ser Gl - #y Ser Gly Arg Pro Arg     #             30     - Leu Arg Leu Ile Arg Asn Ala Pro Val Phe Al - #a Ala Pro Ala Thr Val     #         45     - Val Gly Met Asp Pro Thr Val Glu Arg Leu Ly - #s Ser Gly Phe Gln Lys     #     60     - Phe Lys Thr Glu Val Tyr Asp Lys Lys Pro Gl - #u Leu Phe Glu Pro Leu     # 80     - Lys Ser Gly Gln Ser Pro Arg Tyr Met Val Ph - #e Ala Cys Ser Asp Ser     #                 95     - Arg Val Cys Pro Ser Val Thr Leu Gly Leu Gl - #n Pro Gly Glu Ala Phe     #           110     - Thr Val Arg Asn Ile Ala Ser Met Val Pro Pr - #o Tyr Asp Lys Ile Lys     #       125     - Tyr Ala Gly Thr Gly Ser Ala Ile Glu Tyr Al - #a Val Cys Ala Leu Lys     #   140     - Val Gln Val Ile Val Val Ile Gly His Ser Cy - #s Cys Gly Gly Ile Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Leu Leu Ser Leu Lys Asp Gly Ala Pro As - #p Asn Phe Thr Phe Val     #               175     - Glu Asp Trp Val Arg Ile Gly Ser Pro Ala Ly - #s Asn Lys Val Lys Lys     #           190     - Glu His Ala Ser Val Pro Phe Asp Asp Gln Cy - #s Ser Ile Leu Glu Lys     #       205     - Glu Ala Val Asn Val Ser Leu Gln Asn Leu Ly - #s Ser Tyr Pro Phe Val     #   220     - Lys Glu Gly Leu Ala Gly Gly Thr Leu Lys Le - #u Val Gly Ala His Tyr     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Phe Val Lys Gly Gln Phe Val Thr Trp Gl - #u Pro Pro Gln Asp Ala     #               255     - Ile Glu Arg Leu Thr Ser Gly Phe Gln Gln Ph - #e Lys Val Asn Val Tyr     #           270     - Asp Lys Lys Pro Glu Leu Phe Gly Pro Leu Ly - #s Ser Gly Gln Ala Pro     #       285     - Lys Tyr Met Val Phe Ala Cys Ser Asp Ser Ar - #g Val Cys Pro Ser Val     #   300     - Thr Leu Gly Leu Gln Pro Ala Lys Ala Phe Th - #r Val Arg Asn Ile Ala     305                 3 - #10                 3 - #15                 3 -     #20     - Ala Met Val Pro Gly Tyr Asp Lys Thr Lys Ty - #r Thr Gly Ile Gly Ser     #               335     - Ala Ile Glu Tyr Ala Val Cys Ala Leu Lys Va - #l Glu Val Leu Val Val     #           350     - Ile Gly His Ser Cys Cys Gly Gly Ile Arg Al - #a Leu Leu Ser Leu Lys     #       365     - Asp Gly Ala Pro Asp Asn Phe His Phe Val Gl - #u Asp Trp Val Arg Ile     #   380     - Gly Ser Pro Ala Lys Asn Lys Val Lys Lys Gl - #u His Ala Ser Val Pro     385                 3 - #90                 3 - #95                 4 -     #00     - Phe Asp Asp Gln Cys Ser Ile Leu Glu Lys Gl - #u Ala Val Asn Val Ser     #               415     - Leu Gln Asn Leu Lys Ser Tyr Pro Leu Val Ly - #s Glu Gly Leu Ala Gly     #           430     - Gly Thr Ser Ser Gly Trp Pro His Tyr Asp Ph - #e Val Lys Gly Gln Phe     #       445     - Val Thr Trp Glu Pro Pro Gln Asp Ala Ile Gl - #u Arg Leu Thr Ser Gly     #   460     - Phe Gln Gln Phe Lys Val Asn Val Tyr Asp Ly - #s Lys Pro Glu Leu Phe     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Pro Leu Lys Ser Gly Gln Ala Pro Lys Ty - #r Met Val Phe Ala Cys     #               495     - Ser Asp Ser Arg Val Ser Pro Ser Val Thr Le - #u Gly Leu Gln Pro Gly     #           510     - Glu Ala Phe Thr Val Arg Asn Ile Ala Ala Me - #t Val Pro Gly Tyr Asp     #       525     - Lys Thr Lys Tyr Thr Gly Ile Gly Ser Ala Il - #e Glu Tyr Ala Val Cys     #   540     - Ala Leu Lys Val Glu Val Leu Val Val Ile Gl - #y His Ser Cys Cys Gly     545                 5 - #50                 5 - #55                 5 -     #60     - Gly Ile Arg Ala Leu Leu Ser Leu Gln Asp Gl - #y Ala Pro Asp Thr Phe     #               575     - His Phe Val Glu Asp Trp Val Lys Ile Ala Ph - #e Ile Ala Lys Met Lys     #           590     - Val Lys Lys Glu His Ala Ser Val Pro Phe As - #p Asp Gln Trp Ser Ile     #       605     - Leu Glu Lys Glu Ala Val Asn Val Ser Leu Gl - #u Asn Leu Lys Thr Tyr     #   620     - Pro Phe Val Lys Glu Gly Leu Ala Asn Gly Th - #r Leu Lys Leu Ile Gly     625                 6 - #30                 6 - #35                 6 -     #40     - Ala His Tyr Asp Phe Val Ser Gly Glu Phe Le - #u Thr Trp Lys Lys     #               655     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2190 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: other nucleic acid     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1965     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     - ATG TAC ACA TTG CCC GTC CGT GCC ACC ACA TC - #C AGC ATC GTC GCC AGC       48     Met Tyr Thr Leu Pro Val Arg Ala Thr Thr Se - #r Ser Ile Val Ala Ser     #       285     - CTC GCC ACC CCC GCG CCG TCC TCC TCC TCC GG - #C TCC GGC CGC CCC AGG       96     Leu Ala Thr Pro Ala Pro Ser Ser Ser Ser Gl - #y Ser Gly Arg Pro Arg     #   300     - CTC AGG CTC ATC CGG AAC GCC CCC GTC TTC GC - #C GCC CCC GCC ACC GTC      144     Leu Arg Leu Ile Arg Asn Ala Pro Val Phe Al - #a Ala Pro Ala Thr Val     305                 3 - #10                 3 - #15                 3 -     #20     - GTG GGC ATG GAC CCC ACC GTC GAG CGC TTG AA - #G AGC GGG TTC CAG AAG      192     Val Gly Met Asp Pro Thr Val Glu Arg Leu Ly - #s Ser Gly Phe Gln Lys     #               335     - TTC AAG ACC GAG GTC TAT GAC AAG AAG CCG GA - #G CTG TTC GAG CCT CTC      240     Phe Lys Thr Glu Val Tyr Asp Lys Lys Pro Gl - #u Leu Phe Glu Pro Leu     #           350     - AAG TCC GGC CAG AGC CCC AGG TAC ATG GTG TT - #C GCC TGC TCC GAC TCC      288     Lys Ser Gly Gln Ser Pro Arg Tyr Met Val Ph - #e Ala Cys Ser Asp Ser     #       365     - CGC GTG TGC CCG TCG GTG ACA CTG GGA CTG CA - #G CCC GGC GAG GCA TTC      336     Arg Val Cys Pro Ser Val Thr Leu Gly Leu Gl - #n Pro Gly Glu Ala Phe     #   380     - ACC GTC CGC AAC ATC GCT TCC ATG GTC CCA CC - #C TAC GAC AAG ATC AAG      384     Thr Val Arg Asn Ile Ala Ser Met Val Pro Pr - #o Tyr Asp Lys Ile Lys     385                 3 - #90                 3 - #95                 4 -     #00     - TAC GCC GGC ACA GGG TCC GCC ATC GAG TAC GC - #C GTG TGC GCG CTC AAG      432     Tyr Ala Gly Thr Gly Ser Ala Ile Glu Tyr Al - #a Val Cys Ala Leu Lys     #               415     - GTG CAG GTC ATC GTG GTC ATT GGC CAC AGC TG - #C TGC GGT GGC ATC AGG      480     Val Gln Val Ile Val Val Ile Gly His Ser Cy - #s Cys Gly Gly Ile Arg     #           430     - GCG CTC CTC TCC CTC AAG GAC GGC GCG CCC GA - #C AAC TTC ACC TTC GTG      528     Ala Leu Leu Ser Leu Lys Asp Gly Ala Pro As - #p Asn Phe Thr Phe Val     #       445     - GAG GAC TGG GTC AGG ATC GGC AGC CCT GCC AA - #G AAC AAG GTG AAG AAA      576     Glu Asp Trp Val Arg Ile Gly Ser Pro Ala Ly - #s Asn Lys Val Lys Lys     #   460     - GAG CAC GCG TCC GTG CCG TTC GAT GAC CAG TG - #C TCC ATC CTG GAG AAG      624     Glu His Ala Ser Val Pro Phe Asp Asp Gln Cy - #s Ser Ile Leu Glu Lys     465                 4 - #70                 4 - #75                 4 -     #80     - GAG GCC GTG AAC GTG TCG CTC CAG AAC CTC AA - #G AGC TAC CCC TTC GTC      672     Glu Ala Val Asn Val Ser Leu Gln Asn Leu Ly - #s Ser Tyr Pro Phe Val     #               495     - AAG GAA GGG CTG GCC GGC GGG ACG CTC AAG CT - #G GTT GGC GCC CAC TAC      720     Lys Glu Gly Leu Ala Gly Gly Thr Leu Lys Le - #u Val Gly Ala His Tyr     #           510     - AGC TTC GTC AAA GGG CAG TTC GTC ACA TGG GA - #G CCT CCC CAG GAC GCC      768     Ser Phe Val Lys Gly Gln Phe Val Thr Trp Gl - #u Pro Pro Gln Asp Ala     #       525     - ATC GAG CGC TTG ACG AGC GGC TTC CAG CAG TT - #C AAG GTC AAT GTC TAT      816     Ile Glu Arg Leu Thr Ser Gly Phe Gln Gln Ph - #e Lys Val Asn Val Tyr     #   540     - GAC AAG AAG CCG GAG CTT TTC GGG CCT CTC AA - #G TCC GGC CAG GCC CCC      864     Asp Lys Lys Pro Glu Leu Phe Gly Pro Leu Ly - #s Ser Gly Gln Ala Pro     545                 5 - #50                 5 - #55                 5 -     #60     - AAG TAC ATG GTG TTC GCC TGC TCC GAC TCC CG - #T GTG TGC CCG TCG GTG      912     Lys Tyr Met Val Phe Ala Cys Ser Asp Ser Ar - #g Val Cys Pro Ser Val     #               575     - ACC CTG GGC CTG CAG CCC GCG AAG GCC TTC AC - #C GTT CGC AAC ATC GCC      960     Thr Leu Gly Leu Gln Pro Ala Lys Ala Phe Th - #r Val Arg Asn Ile Ala     #           590     - GCC ATG GTC CCA GGC TAC GAC AAG ACC AAG TA - #C ACC GGC ATC GGG TCC     1008     Ala Met Val Pro Gly Tyr Asp Lys Thr Lys Ty - #r Thr Gly Ile Gly Ser     #       605     - GCC ATC GAG TAC GCT GTG TGC GCC CTC AAG GT - #G GAG GTC CTC GTG GTC     1056     Ala Ile Glu Tyr Ala Val Cys Ala Leu Lys Va - #l Glu Val Leu Val Val     #   620     - ATT GGC CAT AGC TGC TGC GGT GGC ATC AGG GC - #G CTC CTC TCC CTC AAG     1104     Ile Gly His Ser Cys Cys Gly Gly Ile Arg Al - #a Leu Leu Ser Leu Lys     625                 6 - #30                 6 - #35                 6 -     #40     - GAC GGC GCG CCC GAC AAC TTC CAC TTC GTG GA - #G GAC TGG GTC AGG ATC     1152     Asp Gly Ala Pro Asp Asn Phe His Phe Val Gl - #u Asp Trp Val Arg Ile     #               655     - GGC AGC CCT GCC AAG AAC AAG GTG AAG AAA GA - #G CAC GCG TCC GTG CCG     1200     Gly Ser Pro Ala Lys Asn Lys Val Lys Lys Gl - #u His Ala Ser Val Pro     #           670     - TTC GAT GAC CAG TGC TCC ATC CTG GAG AAG GA - #G GCC GTG AAC GTG TCG     1248     Phe Asp Asp Gln Cys Ser Ile Leu Glu Lys Gl - #u Ala Val Asn Val Ser     #       685     - CTC CAG AAC CTC AAG AGC TAC CCC TTG GTC AA - #G GAA GGG CTG GCC GGC     1296     Leu Gln Asn Leu Lys Ser Tyr Pro Leu Val Ly - #s Glu Gly Leu Ala Gly     #   700     - GGG ACG TCA AGT GGT TGG CCC CAC TAC GAC TT - #C GTT AAA GGG CAG TTC     1344     Gly Thr Ser Ser Gly Trp Pro His Tyr Asp Ph - #e Val Lys Gly Gln Phe     705                 7 - #10                 7 - #15                 7 -     #20     - GTC ACA TGG GAG CCT CCC CAG GAC GCC ATC GA - #G CGC TTG ACG AGC GGC     1392     Val Thr Trp Glu Pro Pro Gln Asp Ala Ile Gl - #u Arg Leu Thr Ser Gly     #               735     - TTC CAG CAG TTC AAG GTC AAT GTC TAT GAC AA - #G AAG CCG GAG CTT TTC     1440     Phe Gln Gln Phe Lys Val Asn Val Tyr Asp Ly - #s Lys Pro Glu Leu Phe     #           750     - GGG CCT CTC AAG TCC GGC CAG GCC CCC AAG TA - #C ATG GTG TTC GCC TGC     1488     Gly Pro Leu Lys Ser Gly Gln Ala Pro Lys Ty - #r Met Val Phe Ala Cys     #       765     - TCC GAC TCC CGT GTG TCC CCG TCG GTG ACC CT - #G GGC CTG CAG CCC GGC     1536     Ser Asp Ser Arg Val Ser Pro Ser Val Thr Le - #u Gly Leu Gln Pro Gly     #   780     - GAG GCC TTC ACC GTT CGC AAC ATC GCC GCC AT - #G GTC CCC GGC TAC GAC     1584     Glu Ala Phe Thr Val Arg Asn Ile Ala Ala Me - #t Val Pro Gly Tyr Asp     785                 7 - #90                 7 - #95                 8 -     #00     - AAG ACC AAG TAC ACC GGC ATC GGG TCC GCC AT - #C GAG TAC GCT GTG TGC     1632     Lys Thr Lys Tyr Thr Gly Ile Gly Ser Ala Il - #e Glu Tyr Ala Val Cys     #               815     - GCC CTC AAG GTG GAG GTC CTC GTG GTC ATT GG - #C CAT AGC TGC TGC GGT     1680     Ala Leu Lys Val Glu Val Leu Val Val Ile Gl - #y His Ser Cys Cys Gly     #           830     - GGC ATC AGG GCG CTC CTC TCA CTC CAG GAC GG - #C GCA CCT GAC ACC TTC     1728     Gly Ile Arg Ala Leu Leu Ser Leu Gln Asp Gl - #y Ala Pro Asp Thr Phe     #       845     - CAC TTC GTC GAG GAC TGG GTT AAG ATC GCC TT - #C ATT GCC AAG ATG AAG     1776     His Phe Val Glu Asp Trp Val Lys Ile Ala Ph - #e Ile Ala Lys Met Lys     #   860     - GTA AAG AAA GAG CAC GCC TCG GTG CCG TTC GA - #T GAC CAG TGG TCC ATT     1824     Val Lys Lys Glu His Ala Ser Val Pro Phe As - #p Asp Gln Trp Ser Ile     865                 8 - #70                 8 - #75                 8 -     #80     - CTC GAG AAG GAG GCC GTG AAC GTG TCC CTG GA - #G AAC CTC AAG ACC TAC     1872     Leu Glu Lys Glu Ala Val Asn Val Ser Leu Gl - #u Asn Leu Lys Thr Tyr     #               895     - CCC TTC GTC AAG GAA GGG CTT GCA AAT GGG AC - #C CTC AAG CTG ATC GGC     1920     Pro Phe Val Lys Glu Gly Leu Ala Asn Gly Th - #r Leu Lys Leu Ile Gly     #           910     - GCC CAC TAC GAC TTT GTC TCA GGA GAG TTC CT - #C ACA TGG AAA AAG     1965     Ala His Tyr Asp Phe Val Ser Gly Glu Phe Le - #u Thr Trp Lys Lys     #       925     - TGAAAAACTA GGGCTAAGGC AATTCTACCG GCCCGCCGAC TCCTGCATCA TC - #ATAAATAT     2025     - ATATACTCTA TAACTATACT ACTACGTACC TACCGATATG CACCCGAGCA AT - #GTGAATGC     2085     - GTCGAGTACT ATCTGTTTTC TGCATCTACA TATATATACC GGATCAACAA TC - #GCCCAATG     2145     #                2190TC ATTTTCTACC ACTTTTCATT CCTAA     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 546 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     - Met Tyr Thr Leu Pro Val Arg Ala Thr Thr Se - #r Ser Ile Val Ala Ser     #                 15     - Leu Ala Thr Pro Ala Pro Ser Ser Ser Ser Gl - #y Ser Gly Arg Pro Arg     #             30     - Leu Arg Leu Ile Arg Asn Ala Pro Val Phe Al - #a Ala Pro Ala Thr Val     #         45     - Cys Lys Arg Asp Gly Gly Gln Leu Arg Ser Gl - #n Thr Arg Glu Ile Glu     #     60     - Arg Glu Arg Lys Gly Gly His Pro Pro Ala Gl - #y Gly His Lys Arg Gly     # 80     - Gly Glu Arg Gly Gln Arg Arg Gly Gly Glu Gl - #u Glu Glu Asp Glu Gln     #                 95     - Leu Pro Leu Pro Ser Glu Lys Lys Gly Gly Al - #a Ser Glu Gly Glu Ala     #           110     - Val His Arg Tyr Pro His Leu Val Thr Pro Se - #r Glu Pro Glu Ala Leu     #       125     - Gln Pro Pro Pro Pro Pro Ser Lys Ala Ser Se - #r Lys Gly Met Asp Pro     #   140     - Thr Val Glu Arg Leu Lys Ser Gly Phe Gln Ly - #s Phe Lys Thr Glu Val     145                 1 - #50                 1 - #55                 1 -     #60     - Tyr Asp Lys Lys Pro Glu Leu Phe Glu Pro Le - #u Lys Ser Gly Gln Ser     #               175     - Pro Arg Tyr Met Val Phe Ala Cys Ser Asp Se - #r Arg Val Cys Pro Ser     #           190     - Val Thr Leu Gly Leu Gln Pro Gly Glu Ala Ph - #e Thr Val Arg Asn Ile     #       205     - Ala Ser Met Val Pro Pro Tyr Asp Lys Ile Ly - #s Tyr Ala Gly Thr Gly     #   220     - Ser Ala Ile Glu Tyr Ala Val Cys Ala Leu Ly - #s Val Gln Val Ile Val     225                 2 - #30                 2 - #35                 2 -     #40     - Val Ile Gly His Ser Cys Cys Gly Gly Ile Ar - #g Ala Leu Leu Ser Leu     #               255     - Lys Asp Gly Ala Pro Asp Asn Phe Thr Phe Va - #l Glu Asp Trp Val Arg     #           270     - Ile Gly Ser Pro Ala Lys Asn Lys Val Lys Ly - #s Glu His Ala Ser Val     #       285     - Pro Phe Asp Asp Gln Cys Ser Ile Leu Glu Ly - #s Glu Ala Val Asn Val     #   300     - Ser Leu Gln Asn Leu Lys Ser Tyr Pro Phe Va - #l Lys Glu Gly Leu Ala     305                 3 - #10                 3 - #15                 3 -     #20     - Gly Gly Thr Leu Lys Leu Val Gly Ala His Se - #r His Phe Val Lys Gly     #               335     - Gln Phe Val Thr Trp Glu Pro Pro Gln Asp Al - #a Ile Glu Arg Leu Thr     #           350     - Ser Gly Phe Gln Gln Phe Lys Val Asn Val Ty - #r Asp Lys Lys Pro Glu     #       365     - Leu Phe Gly Pro Leu Lys Ser Gly Gln Ala Pr - #o Lys Tyr Met Val Phe     #   380     - Ala Cys Ser Asp Ser Arg Val Cys Pro Ser Va - #l Thr Leu Gly Leu Gln     385                 3 - #90                 3 - #95                 4 -     #00     - Pro Gly Glu Ala Phe Thr Val Arg Asn Ile Al - #a Ala Met Val Pro Gly     #               415     - Tyr Asp Lys Thr Lys Tyr Thr Gly Ile Gly Se - #r Ala Ile Glu Tyr Ala     #           430     - Val Cys Ala Leu Lys Val Glu Val Leu Val Va - #l Ile Gly His Ser Cys     #       445     - Cys Gly Gly Ile Arg Ala Leu Leu Ser Leu Gl - #n Gly Thr Gly Ala Ala     #   460     - Tyr Thr Phe His Phe Val Glu Asp Trp Val Ly - #s Ile Gly Phe Ile Ala     465                 4 - #70                 4 - #75                 4 -     #80     - Lys Met Lys Val Lys Lys Glu His Ala Ser Va - #l Pro Phe Asp Asp Gln     #               495     - Cys Ser Ile Leu Glu Lys Glu Ala Val Asn Va - #l Ser Leu Glu Asn Leu     #           510     - Lys Thr Tyr Pro Phe Val Lys Glu Gly Leu Al - #a Asn Gly Thr Leu Lys     #       525     - Leu Ile Gly Ala His Tyr Asp Phe Val Ser Gl - #y Glu Phe Leu Thr Trp     #   540     - Lys Lys     545     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1935 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: other nucleic acid     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1638     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     - ATG TAC ACA TTG CCC GTC CGT GCC ACC ACA TC - #C AGC ATC GTC GCC AGC       48     Met Tyr Thr Leu Pro Val Arg Ala Thr Thr Se - #r Ser Ile Val Ala Ser     #               670     - CTC GCC ACC CCC GCG CCG TCC TCC TCC TCC GG - #C TCC GGC CGC CCC AGG       96     Leu Ala Thr Pro Ala Pro Ser Ser Ser Ser Gl - #y Ser Gly Arg Pro Arg     #           685     - CTC AGG CTC ATC CGG AAC GCC CCC GTC TTC GC - #C GCC CCC GCC ACC GTC      144     Leu Arg Leu Ile Arg Asn Ala Pro Val Phe Al - #a Ala Pro Ala Thr Val     #       700     - TGT AAA CGG GAC GGC GGG CAG CTG AGG AGT CA - #A ACG AGA GAG ATC GAG      192     Cys Lys Arg Asp Gly Gly Gln Leu Arg Ser Gl - #n Thr Arg Glu Ile Glu     #   715     - AGA GAA AGA AAG GGA GGG CAT CCA CCA GCC GG - #C GGG CAT AAG AGG GGA      240     Arg Glu Arg Lys Gly Gly His Pro Pro Ala Gl - #y Gly His Lys Arg Gly     720                 7 - #25                 7 - #30                 7 -     #35     - GGA GAG AGA GGC CAG AGA AGA GGA GGA GAA GA - #A GAA GAA GAT GAG CAG      288     Gly Glu Arg Gly Gln Arg Arg Gly Gly Glu Gl - #u Glu Glu Asp Glu Gln     #               750     - CTG CCT CTG CCT TCC GAA AAA AAA GGA GGG GC - #C AGC GAA GGA GAA GCC      336     Leu Pro Leu Pro Ser Glu Lys Lys Gly Gly Al - #a Ser Glu Gly Glu Ala     #           765     - GTC CAC AGA TAC CCC CAC CTC GTC ACT CCT TC - #A GAA CCA GAA GCC CTC      384     Val His Arg Tyr Pro His Leu Val Thr Pro Se - #r Glu Pro Glu Ala Leu     #       780     - CAA CCT CCA CCT CCT CCC TCC AAG GCT TCC TC - #C AAG GGC ATG GAC CCC      432     Gln Pro Pro Pro Pro Pro Ser Lys Ala Ser Se - #r Lys Gly Met Asp Pro     #   795     - ACC GTC GAG CGC TTG AAG AGC GGG TTC CAG AA - #G TTC AAG ACC GAG GTC      480     Thr Val Glu Arg Leu Lys Ser Gly Phe Gln Ly - #s Phe Lys Thr Glu Val     800                 8 - #05                 8 - #10                 8 -     #15     - TAT GAC AAG AAG CCG GAG CTG TTC GAG CCT CT - #C AAG TCC GGC CAG AGC      528     Tyr Asp Lys Lys Pro Glu Leu Phe Glu Pro Le - #u Lys Ser Gly Gln Ser     #               830     - CCC AGG TAC ATG GTG TTC GCC TGC TCC GAC TC - #C CGC GTG TGC CCG TCG      576     Pro Arg Tyr Met Val Phe Ala Cys Ser Asp Se - #r Arg Val Cys Pro Ser     #           845     - GTG ACA CTG GGA CTG CAG CCC GGC GAG GCA TT - #C ACC GTC CGC AAC ATC      624     Val Thr Leu Gly Leu Gln Pro Gly Glu Ala Ph - #e Thr Val Arg Asn Ile     #       860     - GCT TCC ATG GTC CCA CCC TAC GAC AAG ATC AA - #G TAC GCC GGC ACA GGG      672     Ala Ser Met Val Pro Pro Tyr Asp Lys Ile Ly - #s Tyr Ala Gly Thr Gly     #   875     - TCC GCC ATC GAG TAC GCC GTG TGC GCG CTC AA - #G GTG CAG GTC ATC GTG      720     Ser Ala Ile Glu Tyr Ala Val Cys Ala Leu Ly - #s Val Gln Val Ile Val     880                 8 - #85                 8 - #90                 8 -     #95     - GTC ATT GGC CAC AGC TGC TGC GGT GGC ATC AG - #G GCG CTC CTC TCC CTC      768     Val Ile Gly His Ser Cys Cys Gly Gly Ile Ar - #g Ala Leu Leu Ser Leu     #               910     - AAG GAC GGC GCG CCC GAC AAC TTC ACC TTC GT - #G GAG GAC TGG GTC AGG      816     Lys Asp Gly Ala Pro Asp Asn Phe Thr Phe Va - #l Glu Asp Trp Val Arg     #           925     - ATC GGC AGC CCT GCC AAG AAC AAG GTG AAG AA - #A GAG CAC GCG TCC GTG      864     Ile Gly Ser Pro Ala Lys Asn Lys Val Lys Ly - #s Glu His Ala Ser Val     #       940     - CCG TTC GAT GAC CAG TGC TCC ATC CTG GAG AA - #G GAG GCC GTG AAC GTG      912     Pro Phe Asp Asp Gln Cys Ser Ile Leu Glu Ly - #s Glu Ala Val Asn Val     #   955     - TCG CTC CAG AAC CTC AAG AGC TAC CCC TTC GT - #C AAG GAA GGG CTG GCC      960     Ser Leu Gln Asn Leu Lys Ser Tyr Pro Phe Va - #l Lys Glu Gly Leu Ala     960                 9 - #65                 9 - #70                 9 -     #75     - GGC GGG ACG CTC AAG CTG GTT GGC GCC CAC TC - #A CAC TTC GTC AAA GGG     1008     Gly Gly Thr Leu Lys Leu Val Gly Ala His Se - #r His Phe Val Lys Gly     #               990     - CAG TTC GTC ACA TGG GAG CCT CCC CAG GAC GC - #C ATC GAG CGC TTG ACG     1056     Gln Phe Val Thr Trp Glu Pro Pro Gln Asp Al - #a Ile Glu Arg Leu Thr     #          10050     - AGC GGC TTC CAG CAG TTC AAG GTC AAT GTC TA - #T GAC AAG AAG CCG GAG     1104     Ser Gly Phe Gln Gln Phe Lys Val Asn Val Ty - #r Asp Lys Lys Pro Glu     #      10205     - CTT TTC GGG CCT CTC AAG TCC GGC CAG GCC CC - #C AAG TAC ATG GTG TTC     1152     Leu Phe Gly Pro Leu Lys Ser Gly Gln Ala Pr - #o Lys Tyr Met Val Phe     #  10350     - GCC TGC TCC GAC TCC CGT GTG TGC CCG TCG GT - #G ACC CTG GGC CTG CAG     1200     Ala Cys Ser Asp Ser Arg Val Cys Pro Ser Va - #l Thr Leu Gly Leu Gln     #               10551045 - #                1050     - CCG GGC GAG GCC TTC ACC GTT CGC AAC ATC GC - #C GCC ATG GTC CCA GGC     1248     Pro Gly Glu Ala Phe Thr Val Arg Asn Ile Al - #a Ala Met Val Pro Gly     #              10705     - TAC GAC AAG ACC AAG TAC ACC GGC ATC GGG TC - #C GCC ATC GAG TAC GCT     1296     Tyr Asp Lys Thr Lys Tyr Thr Gly Ile Gly Se - #r Ala Ile Glu Tyr Ala     #          10850     - GTG TGC GCC CTC AAG GTG GAG GTC CTC GTG GT - #C ATT GGC CAT AGC TGC     1344     Val Cys Ala Leu Lys Val Glu Val Leu Val Va - #l Ile Gly His Ser Cys     #      11005     - TGC GGT GGC ATC AGG GCG CTC CTC TCC CTC CA - #A GGA ACC GGC GCA GCC     1392     Cys Gly Gly Ile Arg Ala Leu Leu Ser Leu Gl - #n Gly Thr Gly Ala Ala     #  11150     - TAC ACC TTC CAC TTC GTC GAG GAC TGG GTT AA - #G ATC GGC TTC ATT GCC     1440     Tyr Thr Phe His Phe Val Glu Asp Trp Val Ly - #s Ile Gly Phe Ile Ala     #               11351125 - #                1130     - AAG ATG AAG GTA AAG AAA GAG CAC GCC TCG GT - #G CCG TTC GAT GAC CAG     1488     Lys Met Lys Val Lys Lys Glu His Ala Ser Va - #l Pro Phe Asp Asp Gln     #              11505     - TGC TCC ATT CTC GAG AAG GAG GCC GTG AAC GT - #G TCC CTG GAG AAC CTC     1536     Cys Ser Ile Leu Glu Lys Glu Ala Val Asn Va - #l Ser Leu Glu Asn Leu     #          11650     - AAG ACC TAC CCC TTC GTC AAG GAA GGG CTT GC - #A AAT GGG ACC CTC AAG     1584     Lys Thr Tyr Pro Phe Val Lys Glu Gly Leu Al - #a Asn Gly Thr Leu Lys     #      11805     - CTG ATC GGC GCC CAC TAC GAC TTT GTC TCA GG - #A GAG TTC CTC ACA TGG     1632     Leu Ile Gly Ala His Tyr Asp Phe Val Ser Gl - #y Glu Phe Leu Thr Trp     #  11950     - AAA AAG TGAAAAACTA GGGCTAAGGC AATTCTACCG GCCCGCCGAC TC - #TGCATCAT     1688     Lys Lys     1200     - CATAATATAT ATACTATAAC TATACTACTA GCTACCTACC GATAGTCACC CG - #AGCAATGT     1748     - GAATGCGTCG AGTACTATCT GTTTTCTGCA TCTACATATA TATACCGGAT CA - #ACAATCGC     1808     - CCAATGTGAA TGTAATAAGC AATATCATTT TCTACCACTT TTCATTCCTA AC - #GCTGAGGC     1868     - TTTTTATGTA CTATATCTTA TATGATGAAT AATAATATGA CCGCCTTGTG AT - #CTAAAAAA     1928     #        1935     __________________________________________________________________________ 

We claim:
 1. An isolated and purified DNA encoding the amino acid sequence according to SEQ ID NO.
 1. 2. An isolated and purified DNA sequence having the nucleotide sequence according to SEQ ID NO.
 2. 3. An isolated and purified DNA encoding the amino acid sequence according to SEQ ID NO.
 4. 4. An isolated and purified DNA sequence having the nucleotide sequence according to SEQ ID NO.
 5. 5. An isolated and purified DNA encoding the amino acid sequence according to SEQ ID NO.
 6. 6. An isolated and purified DNA sequence having the nucleotide sequence according to SEQ ID NO.
 7. 7. An isolated and purified DNA encoding the amino acid sequence according to SEQ ID NO.
 8. 8. An isolated and purified DNA sequence having the nucleotide sequence according to SEQ ID NO.
 9. 